A Study of Building an Reverse Dictionary
نویسنده
چکیده
A traditional forward dictionary is where the user inputs a word and gets one or more definitions as output. Whereas, in a reverse dictionary the user enters the desired phrase with logic and gets a certain number of words as outputs similar to the concept of the user input phrase. The reverse dictionary addresses the wide spread problem of knowing the meaning of a word, but unable to recall the appropriate word on demand .In this paper introduce a broad overview of the reverse dictionary and what are the techniques is used in reverse dictionary, some limitation of the reverse dictionary. Keywords—Reverse Dictionary, Pos Tagging, Concept mining, stemming INTRODUCTION A reverse dictionary can be viewed as an organized collection of concepts, word meanings or phrases and this is in contrast to a forward dictionary. So its function is similar to that of a thesaurus where one can look up a phrase by a general word or find a similar word or synonyms to the input word. The user enters the desired phrase with logic and gets a certain number of words as outputs similar to the concept of the user input phrase. The user input may be matching the definition that is already present or not. Such a reverse dictionary would be useful for linguists and poets who might be looking for words ending with a particular suffix, or by an anthropologist or forensics specialist examining a damaged text (e.g. a stone inscription, or a burned document) that had only the final portion of a particular word preserved. Reverse dictionary of this type have been published for most major alphabetical languages. By way of contras, in a standard dictionary words are organized such that word and subsequent letters proceeding toward the end of that word. A reverse dictionary performs a reverse mapping i.e., given a phrase describing a desired concept. It provides words whose definitions match the entered definition phrase, as opposed to a regular dictionary that maps words to their definitions. For example, a forward dictionary informs the user that the meaning of the word “regret” is “to feel sorry.” Whereas reverse dictionary, offers the user an opportunity to enter the phrase “feeling of loss or longing for someone” as input, and can be expected to receive the word “regret” and possibly other words with similar meaning as output. Most of the techniques for the creation of reverse dictionary is based on the creation of multiple databases for synonyms, hyponym, antonym etc. This paper is organized as follows. In Section II, we present the system study of the Reverse Dictionary (RD). In section III, the previous works related to this research are reported. In section IV, we present the problem description. Conclusion is presented in the final section V. SYSTEM STUDY In this section we describe about what is reverse dictionary and structure of the reverse dictionary and how it is works. A. History of Reverse Dictionary A reverse dictionary is a dictionary organized in a non-standard order (usually referring to being in a so called "reverse" order) that provides the user with information that would be difficult to obtain from a traditionally alphabetized dictionary. Reverse dictionaries were historically difficult to produce before the advent of the electronic computer and have become more common since 1974 when Stahl and Scavnicky's Reverse Spanish Dictionary appeared (the first computer sorted reverse dictionary ever published). The reverse word dictionary refers to a dictionary where the word entries in the dictionary are not alphabetized in the same manner as traditional dictionary. For example, A Reverse Dictionary of the Spanish Language and Walker's Rhyming Dictionary are reverse dictionaries, the organization of which is based upon sorting each entry word upon its last letter and the subsequent letters proceeding toward the beginning of that word. Consequently, in these reverse dictionaries all words that have the same suffix appear in order in the dictionary. Such a reverse dictionary would be useful for linguists and poets who might be looking for words ending with a particular suffix, or by an anthropologist or forensics specialist examining a damaged text (e.g. a stone inscription, or a burned document) that had only the final portion of a particular word preserved. Reverse dictionaries of this type have been published for most major alphabetical languages (see numerous examples listed below). By way of contrast, in a standard dictionary words are organized such that words with the same prefix appear in order, since the sorting order is starting with the first letter of the entry word and subsequent letters proceeding toward the end of that word. Some sort utilities can sort on a reversed version of the sort key, making it easy to generate reverse dictionaries on the fly if the dictionary data is available in electronic form. International Journal of Scientific and Research Publications, Volume 5, Issue 7, July 2015 2 ISSN 2250-3153 www.ijsrp.org Another use of the term "reverse dictionary" is for a reference work that is organized by concepts, phrases, or the definitions of words. This is in contrast to a standard dictionary, in which words are indexed by the headwords, but similar in function to a thesaurus, where one can look up a concept by some common, general word, and then find a list of near-synonyms of that word. (For example, in a thesaurus one could look up "doctor" and be presented with such words as healer, physician, surgeon, M.D., medical man, medicine man, academician, professor, scholar, sage, master, expert.) In theory, a reverse dictionary might go further than this, allowing you to find a word by its definition only (for example, to find the word "doctor" knowing only that he is a "person who cures disease"). Such dictionaries have become more practical with the advent of computerized informationstorage and retrieval systems (i.e. computer databases). B. Architecture for a Reverse Dictionary Fig., 1: Architecture for a Reverse Dictionary The general steps involved in most of the reverse dictionary systems available today. The following steps as follows a) Stemming Steaming is the process of finding the root word of a word contained in a given phrase. Normally stemming is accomplished by means of porter stemmer. b) POS Tagging Part-of –Speech Tagging is the process of each word in a sentence is partitioned into the each word (and other token), such as noun, verb, adjective, etc. pos tagger is piece of software which performs pos tagging. English taggers us the Penn Tree tag set. c) Concept Mining Concept mining is the process of extracting meaning from a sentence or a set of words. Normally the conversion of words to concepts has been performed using a thesaurus. A number of methods are available now days to extract meaning from a given phrase or sentence. d) Forward Dictionary Look Up The meaning extracted after the concept mining phase can be used and consults a forward dictionary to selects those whose definitions are similar to this concept. e) Ranking Candidate Words After the forward dictionary look up phase a set of words whose meaning similar to the input phrase can be identified and this words has to be ordered by means of some probabilistic methods. PREVIOUS WORKS In a reverse dictionary a user phrase is given and we receive certain number of words as output ranked with an algorithm. The related works to this reverse dictionary are the reverse dictionaries [1] [2]. INPUT PHRASE Reverse Dictionary System OUTPUT WORDS Stemming POS Tagging Concept Mining ,FD Look up Ranking Candidate Words International Journal of Scientific and Research Publications, Volume 5, Issue 7, July 2015 3 ISSN 2250-3153 www.ijsrp.org Reference[3] Deals with the design and implementation of a reverse dictionary It described finding candidate words from a forward dictionary data source for user phrase. Candidate words are ranked by quality of match. It consists of two key sub steps build the RMS and Query the RMS. Some Limitation of this paper as follows The response efficiency is not to be similar to forward dictionary online lookups. • There is no enough words or less than the target number of words, we can extend the search using Synonyms, hypernyms, hyponyms. • Database is not updated dynamically. • The lack of contextual information. Reference [4] described Input phrase is analysed using semantic similarity measures and SVM assisted by Word Net. Applying algebraic analysis to select candidate words and rank them. The similarity and relatedness measures are Jiang and Conrath (JCN) and Lin. Semantic similarity measure between each member of the input is ranked and highest values words are chosen. Some Limitation of this paper as follows • Context Based target words are not achieved properly. • Only nouns as word members of the semantic space. • There is no Dynamic updation of reverse dictionary database. Kesavaram.P.H [5] described the search is enhanced by considering the synonyms, hypernyms and hyponyms of that particular word. The final step is to sort out the results based on its rank. Some Limitation of this paper as follows Inflected forms of the words are not considered. For example: Root word “sleep” is considered from “sleeplessness”. Spell checker is not provided. PROBLEM DESCRIPTION In Reverse Dictionary, similarity is calculated based on sentence phrase which is effective as compared with similarity based on each word. Context Based target words are not achieved properly. And only nouns as word members of the semantic space, There is no Dynamic updation of reverse dictionary database. The drawback of [4] is overcome from this proposed system. This paper is in proceeding Each word in a sentence is partitioned into a list of tokens words using Part-Of-Speech Tree tagger. • Here Porter stemming algorithm is used. Porter stemming is a process of removing the common morphological and inflexional endings of words. • The similarity of the sentences based on the similarity of the pairs of words is computed. • Most appropriate sense for every word in a sentence can be found using Word Sense Disambiguation. • List of possible target words are given as suggestion in addition to the required word.
منابع مشابه
An Investigation into Bilingual Dictionary Use: Do the Frequency of Use and Type of Dictionary Make a Difference in L2 Writing Performance?
Bilingual dictionary use in L2 writing test performance has recently been the subject of debate. Opinions differ according to how the trait is understood and whether the system favors the process-oriented or product-oriented views towards the assessment and writing skill. Given the need for more empirical support, this study is aimed at investigating the availability of bilingual dictionary use...
متن کاملStudy of thermal performance of building roofs in the city of Tehran
The design of a building can provide the highest thermal comfort in the interior without any mechanical equipment and save energy to a large extent. The roof of a building is an important part for thermal loss. This research studies the thermal performance of 14 conventional roof structures in Tehran city by using designbuilder 4.5. It is found that the polystyrene block performs best compared ...
متن کاملAn Efficient Double Skin Façade for an Office Building in Shiraz City
Energy efficiency in office buildings has been the center of attention for many researches. This special attention is due to highly energy consumption in this building type. Refinement of facade and building’s envelop is a good approach to reduce buildings energy requirements. Double skin facade concepts are commonly used to achieve that object. Although the concept is not new, there is a growi...
متن کاملA New Dictionary Construction Method in Sparse Representation Techniques for Target Detection in Hyperspectral Imagery
Hyperspectral data in Remote Sensing which have been gathered with efficient spectral resolution (about 10 nanometer) contain a plethora of spectral bands (roughly 200 bands). Since precious information about the spectral features of target materials can be extracted from these data, they have been used exclusively in hyperspectral target detection. One of the problem associated with the detect...
متن کاملStudy and Recognition of Muslim Sage Abdullah Azdi and His Medical Dictionary Called “Kitāb Al-ma”
This study seeks to identify one of the pioneers of traditional clinical medicine named Abdullah Azdi and his medical dictionary. This research is an analytical study. The focus of the search was on two keywords, Abdullah Azdi and Kitab al-Ma'ma, but the scope of the search included all appropriate terms such as: medicine, Bu Ali Sina, traditional medicine, medical dictionary, ethics, and medic...
متن کاملEFL Translation Students' Perspective toward Using Bilingual Dictionary in Translation of Polysemous Words
This research presented the use of bilingual dictionary and addressed the EFL translation students' points of view on the use of bilingual dictionary in translating polysemous words (English to Persian). Moreo- ver, it aimed at finding the possible relationship between the effect of using bilingual dictionary by stu- dents in translating polysemous words and their achieved scores. In the study ...
متن کامل